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Abstract 

We analyze different microscopic RNA models at zero temperature. We discuss both the most 
simple model, that suffers a large degeneracy of the ground state, and models in which the degen- 
eracy has been remove, in a more or less severe manner. We calculate low-energy density of states 
using a coupling perturbing method, where the ground state of a modified Hamiltonian, that repels 
the original ground state, is determined. We evaluate scaling exponents starting from measure- 
ments of overlaps and energy differences. In the case of models without accidental degeneracy of 
the ground state we are able to clearly establish the existence of a glassy phase with 9 ~ 1/3. 
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I. INTRODUCTION 



RNA plays a fundamental role in the biochemistry of all living systems and it is 
commonly believed to be at the origin of the pre-Darwinian epoch of life 0. Much like for 
DNA, the RNA primary structure can be described in terms of strings of the four letter 
alphabet composed by Adenosine, Citosine, Guanine, Uracile (ACGU). Since RNA is usually 
found in the single stranded pattern, formation of double-helix regions is accomplished by 
the molecule folding back onto itself to form Watson- Crick (WC) base pairs G=C and A=U, 
or the slightly less stable G— U pair. One of the most intriguing features of RNA folded 
secondary structures is that in most cases the connectivity graph is planar: This property 
greatly reduces the computational efforts needed for calculating the ground state structure. 

It might be asked whether secondary structures provide an adequate level of description 
for RNA real molecules [^. It is believed that secondary structure description is biologically 
relevant for a number of reasons: Base pairing and base pair stacking provide the major 
part of the free energy of folding secondary structures have been used successfully by 
biologists in the interpretation of RNA function and activity 0, and because structures are 
conserved in evolutionary phylogeny. At the same time computer scientists find this level 
of description rather appealing since secondary structures are discrete and therefore easy to 
compare. Moreover, thanks to the planarity condition, efficient recursive algorithms for the 
computation of the native (ground state) structure are easily implemented [Q. 

Beside the genuine biological interest of RNA models, recently this subject has raised 
considerable attention as an intriguing problem in statistical mechanics of disordered sys- 
tems. The focus is now set on the presence and the nature of a low temperature phase in 
ensembles of random sequences. In a series of recent papers H, 0, |], ^ |10|, |ll], |12| different 
authors have presented a number of evidences (mostly numerical) supporting the existence 
of a transition to a glassy phase (for a review see Ref. ||13|)- While a careful study of the 
equilibrium thermodynamics of the model suggests a smoother than second order phase 
transition 0, there is still much debate about the nature of the low temperature phase, 
since finite size scaling corrections are very hard to keep under control. It has been shown 
in 1^, 1^ that, at least for systems up to 1000 bases, a broad overlap distribution characterizes 
the low temperature phase, but a safe extrapolation to the thermodynamic limit from the 
available data is still out of control H, |9ll. 
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Bundschuh and Hwa [|T0|, |TT] have presented an extensive study on a variety of similar 
RNA models supporting the existence of a low temperature glassy phase. They were able 
to show analytically, via a two replica calculation, that weak quenched sequence disorder is 
equivalent to a high temperature phase in which all replicas are independent (molten phase), 
and that there must be a finite temperature below which replicas start feeling themselves 
as in a strong coupling regime. Numerically the authors established this glassy transition 
measuring the free energy cost of imposing a pinch between two bases. They observed that 
the energy of the pinching excitation (with respect to the ground state) increases with the 
sequence length following a logarithmic law (even if a power law with small exponent was 
not excluded). 

In this paper we study the scaling regime of the lowest energy excitations in different 
models of random RNA secondary structures. Following an idea put forward in 0, we use 
a perturbing method |15[ which has been very valuable in the study of low temperature 
properties of disordered systems |T^: In the following we will call it the e-coupling method. 



Very recently the same procedure has been followed by Krzakala, Mezard and Miiller [12 



We will comment on their results in the concluding section after having presented our data. 

The goal of the e-coupling method is to calculate the energy cost of typical excitations 
above the ground state involving a finite fraction of the system. As in the droplet model |T7 
these energy excitations are assumed to scale as AE{L) cc L^, L being the length of the 
molecule, and 6 being a relevant exponent, that we would like to determine. It is well possible 



that also the local pinching of coupled replicas imposed in [I^, |TT[ would generate "typical" 
configurations, but we believe that our method based on a bulk perturbation will surely do 
so (in this sense we find very illuminating its application). We apply a perturbation that 
is simply a repulsion term from the ground state structure, which forces the system to find 
new low-energy structures far away from the one of the original ground state, without any 
other constraint. 

The paper is organized as follows: in section II we introduce the formalism for describing 
the RNA secondary structure and the different ways we have used to remove the ground 
state degeneracy intrinsic to the original model |^. In section ^ we sketch the method 
we have applied for calculating the low-energy spectrum of the model. We also discuss the 
measurable observables. In section ^ we present our results, focusing on the differences 
and similarities among the models we have introduced. Finally, in section |V|, we summarize 
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our findings, we compare with previous work and we comment on further developments. 



II. MODELS 

The secondary structure of RNA is the set of base pairs that occur in its three-dimensional 
structure. Let us define a sequence of basis as 7^ = {ri,r2, ...,rn}, Vi being the i^'^ base of 
the chain and G {A, C,G,U}. A secondary structure on 71 is now defined as a set S of 
pairs (with the convention that 1 < i < j < L) satisfying the following rules: 

• j — i > 4 : this restriction permits flexibility of the chain in its three-dimensional 
arrangement; 

• two different base pairs {i, j), {i' , j') G 5 if and only if (without loss of generality 
we can assume that i < i') i < j < i' < j', i.e. the pair precedes or 
i < i' < j' < j , i.e. the pair includes {i' ■,]')■ This rule, called planarity condition, 
excludes the occurrence of the so-called pseudo-knots, which are very unlikely in real 
RNA. 

We consider a simplified model for RNA folding, very similar to the one studied in [|^. 
The model is described in terms of the Hamiltonian: 

7i = ^ Cij = ^ ^ij^ij y (1) 

where Cij is the pairing energy between bases i and j and the variable iij takes value 1 
if {i,j) G S and otherwise. On a first approximation one can assume that the pairing 
energies depend only on the paired bases, Cij = e{ri,rj). Reasonable values for the energies 
e{ri,rj) of the allowed base pairs (C-G, A-U and G-U) at room temperature are of C(l) 
kcal/mole One could consider other phenomenological parameters in order to take into 
account the whole complexity of a realistic energy function @, [18|. 



We have assumed a drastic approximation in order to get a tractable model both from a 
numerical and analytical point of view. We consider sequences made of 4 symbols (A, C, G 
and U) and we assume that only Watson-Crick base pairs may occur: we use a strong C-G 
coupling of energy —2 (in arbitrary units) and a weak A-U coupling of energy —1. All the 
other possible couplings increase the energy, so that the system avoids these links. One of 
the advantages of this model is that the role of the disorder (encoded in the random sequence 



TZ) is clearly separated from that of the frustration (induced by the planarity condition on 
the structure S). 

This 4 letters model has an exponentially large ground state degeneracy, which gives a 
finite T = entropy (as already found in ^ for the 2 letters model). For this reason we 
refer to it as the degenerate model (the D model). 

The large ground state degeneracy that occurs in this D model is a pathology of a frus- 
trated models with simple discrete interactions: Since the couplings can take only the two 
negative values —2 and —1 the same exact energetic situation can be realized in many ways. 
This accidental degeneracy will probably not play a relevant role in the physical RNA: Since 
real RNA energy function is far more complex than that, ground state degeneracy is un- 
likely to occur. Because of that we define two new models with modified pairing energies, in 
order to remove the degeneracy. In both models this aim is accomplished by adding a small 
random perturbing term rjij to the pairing energies: Cij Cij + rjij. 

In the quasi-degenerate model (the QD model) the rjij are i.i.d. variables extracted from 
a Gaussian distribution of zero mean (rj) = and variance (r^^) = rj^/L, with rjQ a small 
and finite constant of the order of 0.1: When L ^ oo the pairing energies are modified of 
an infinitesimal amount. The variance is chosen such that the energies of the ground states 
(which are degenerate for r^o = 0) are split over an 0{t]o) range. In this way we preserve 
somehow the structure of the original energy spectrum and the sequence still plays a key 
role, but the unphysical degeneracy is lifted and the ground state is now unique. 

In the non-degenerate model (the ND model) the variance of the rjij variables is finite, 
{t]'^) = rjl (we use r]Q = 0.1). This variance induces an 0{\^) splitting of the degenerated 
ground states, which has to be considered as a strong reshuffling of the original energy 
spectrum, since the energy gaps among levels in the original model were of 0{1). The 
resulting energy landscape now depends very little on the sequence. Because of that the ND 
model is very similar to the "Gaussian disorder model" (the GD model), already discussed 
in [|11|, where the Cij are i.i.d. Gaussian variables of zero mean and unitary variance. In this 
model the sequence plays no role. 
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III. METHODS 



The e-coupling method we use to calculate low-energy excitations is the one already 
used in [0 and in |jl2[. It works as follows: First of all one calculates the ground state 



structure £o = {({f} which minimizes 7i. Then one adds a perturbation to the Hamiltonian, 
H' = H — 6{1 — q), where q = = j^i ■ £o is the overlap with the ground state 

structure (note that, with this definition, the overlap is always positive). The perturbation 
term penalizes the structures which are close to the ground state £o and thus acts as a 
repulsive term in the space of structures. Finally one calculates the ground state structure 
of H' for many values of e. Let us call these new structures l^. 

By definition, for any disorder realization J' = {TZ,r]}, both the distance dj{e,L) = 
1 — is ■ £o well as the energy difference AEj{e, L) = 7i(£e) — 'H{£o) between and £o 
are non-decreasing functions of e. Moreover AEj{e, L) < e, since the Hamiltonian has been 
perturbed by a term whose absolute value is less than e, and the structures £e are then low 
energy excited states of the original Hamiltonian. We will indicate without the J subscript 



the observables averaged over the quenched disorder J = {lZ,r)}: d{e,L) = dj{e,L) and 



AE{e,L) = AEj{e,L). 

The algorithm for finding the new ground states of e-coupled system is exactly the same 
one used for the original Hamiltonian: The repulsion from the first ground state is included 
by modifying the values of the original pairing energies Cij. 

In the thermodynamical limit structures differing by a finite AE have the same intensive 
energy, and one could try to understand how they are organized in the configurational space. 
An interesting question is whether, in the large L limit, these structures are extremely close 
together or spread over finite distances. The answer to this question can be given in terms 
of the asymptotic quantity 

doo{e) = lim d{e,L) , (2) 

L— »oo 

which is again a non-decreasing function of e. If dods) = for any finite e then structures 
with the same energy are close together, while if dod^) > for e > £* ~ 0{1) then structures 
with the same intensive energy may have a broad probability distribution functions of their 
distances and overlaps. 

In the case where (ioo(^) = we can derive a relation describing the way d{e, L) vanishes. 
We assume, as in the droplet model 0], that the energy cost of a typical excitation involving 
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a finite fraction of the system (i.e. having finite d) scales with the system size as 

A^typOcL^ . (3) 

We call Il{AE, d, L) the probability distribution (over the disorder) of excitations with 
energy AE and size dL in systems of size L. For any fixed and finite d G (0, 1], we assume 
that Il{AE, d, L) has a finite weight in AE = 0, and so, for normalization reasons, we must 
have n(0, d, L) = c{d)L~'^ for large L (unless there is a delta function in AE = as in the 
D model), where c{d) is a smooth function in the scaling region. 

Once we add the perturbing term —e d to the Hamiltonian, an excitation of size d will be 
activated only if its energy satisfies AE < ed. Thus the average distance of the new ground 
state structure is given by 

rl rey rl 

d{e,L)= ydy Ii{x,y,L) dx = e y^c{y) dy , (4) 

JO JO JO 



for small e and large L \22 



Then we can evaluate the 6 exponent by two independent ways: 

• from Eq. (||), d{e,L) oc e , by measuring the average distance d{e,L) for a fixed 
small £ clS db function of the system size L; 

• from Eq. (^), which can be equivalently rewritten for the average energy difference as 

AE{d,L)(xL^ , (5) 

by measuring the average energy difference for a fixed distance (not fixed e \) as a 
function of the system size L. 



IV. RESULTS 



We study zero-temperature properties of the models described above, i.e. we analyze 
ground states structures (GSS) of the original and of the perturbed Hamiltonians. We start 
showing the data for the T = overlap distribution in the D model (the only one with many 
different ground states). After that we present the results obtained with the e-coupling 
method for all the models defined in section II. 
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FIG. 1: Probability distribution of overlaps between all the pairs of ground state structures in the 
degenerate D model. 

A. The D Model 

The D model possesses an exponentially large number of GSS, which form a set that we 
call Q. In order to understand how they are distributed in the space of structures one can 
calculate the probability distribution function of the overlap, which is defined, for any pair 
of structures, as g = J^ij^if^ij^ ■ 

Unfortunately the zero-temperature entropy of the D model is too large in order to list 
all the ground state structures for values of L large enough to be interesting. Because 
of that we have added to the D model a further constraint, suggested by observations on 
biological RNA, which strongly reduces the entropy while keeping the P{q) very similar 
in shape to the one of the unconstrained model. We avoid structures where a single base 
pairs is surrounded by non-paired bases, that is a structure with £i_ij^i = 0, iij = 1 and 
ii+ij-i = is forbidden. 

The overlap distribution, averaged over 1000 samples, is shown in figure |1]. Is worth 
noticing that the tail for small values of q is disappearing very slowly with increasing system 
size. The peak location as well as the mean overlap converge somewhere around q ~ 0.87, 
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FIG. 2: Average distance (i(e, L) versus e for different chain lengths L in the degenerate D model. 

while the variance decreases with the system size approximately as o"^ oc L^°'^^. So the 
P{q) seems to converge, in the thermodynamic limit, to a delta function centered on a 
value of q smaller than 1. Such a value is compatible with the observation that in a typical 
ground state structure the paired bases are a finite fraction (smaller than 1) of all bases. 



Nevertheless, as already explained in [T^, the triviality of the P{q) at zero temperature 
does not imply a trivial behavior of the whole low-temperature phase, and so we resort to 
the study of low-energy density of states. 

We have calculated the ground state of the Hamiltonian Ti' for 18 values of e G 
[0.001,131.072] (equally spaced on a logarithmic scale), and many L values. We have an- 
alyzed a minimum of 500 disorder realizations for the largest chain (L = 4096), and a 
maximum of circa 5 • 10^ samples for the smallest one (L = 128). 

The first GSS is chosen with uniform probability in Q (the set of all the degenerate 
ground states of the D model). When we switch on the perturbation the new GSS will be 
the one in Q having the smallest overlap with £o- Nothing else will change as long as e < 1, 
that is as long as e is not large enough to make an excited state with = 1 to become 
the new GSS. This explains the plateau for e < 1 in figure |], where we show the average 
distance between Iq and as a function of e. The main information we get from figure 




FIG. 3: Probability distribution P{qmin) of the minimum overlap among any two GSS in the 
degenerate D model. 

is the value of the plateau distance d ~ 0.38, corresponding to an overlap g = 1 — (i ~ 0.62. 
This distance can be viewed as the radius of a sphere containing the set Q of all the GSS. 
Note that the GSS are not uniformly distributed in this sphere [otherwise the P{q) would be 
peaked on a much smaller overlap value] , but they are very dense in the central region and 
very sparse on the boundaries. This means that if one chooses two GSS at random they will 
typically be very close in the dense region, giving a value of g ~ 0.87, but if one forces the 
two GSS to be as far as possible the resulting minimum overlap gmin will be much smaller, 
and will depend strongly on the specific disorder realization [see figure |^, where we plot its 
probability distribution P(gmin)]- 

As it is made clear by the results shown in figure 0, for models with high degeneracy the 
perturbing method does not work properly in the interesting region of small e, giving only 
information about the minimal overlap among GSS. Now we remove the degeneracy and 
analyze the other models. 
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FIG. 4: As in Fig. | for the QD model. 

B. The QD Model 

We have defined the QD model such as to keep as much as possible of the degenerate D 
model, even after removal of the accidental degeneracy. Here we can still distinguish two 
different regimes (see figure ^, where we plot the average distance d{e, L) as a function of e 
for different chain lengths L) : For e > 1 the data coincide with those for the (unconstrained) 
D model, while for e < 1 they have now a non trivial behavior. 

In the interesting region of small e the scaling of the data is very subtle and good results 
can be obtained either with ^ = or with 6 > 0. Our finite size scaling analysis does 
not allow us to reach a quantitative estimate, and we cannot distinguish in a statistically 
significant way among a power law (droplet like) scaling and a logarithmic scaling. Further 
and longer studies are needed to understand better this model. 

Despite the difficulties in the data analysis, we believe that the QD model has a large 
interest and relevance. Indeed it has the great advantage of a single non-degenerate ground 
state, but still the perturbation added in order to remove the original degeneracy modifies 
the energies of the structures by a quantity of order rjo ~ 0.1, thus keeping a large amount 
of information about the original energy landscape of the degenerate D model of RNA. 
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FIG. 5: As in Fig. | for the ND model. 



C. The ND Model 



In the ND model the ground state degeneracy has been removed by a random term which 
strongly reshuffles the energy levels. 

In figure |^ we show the average distance d{6, L) for some values of e and L (the error is 
of the order of the symbol size). Data are now smooth functions of e, with no singular point 
at £ ~ 1, and a finite size scaling analysis can be performed in an easier way. 

Following Eq. (^) we have rescaled the data plotting them versus e . The results are 
shown in figure where we have included all data points. The best collapse is achieved 
when using 9 ~ 0.33. The dotted line has unitary slope and clearly shows that d{e, L) oc e 
for small e and any fixed L. We notice that we are looking for a finite size scaling that works 
well only up to a given value of sL^^: The method we are using is based on the idea of 
having a "small" perturbation that acts as a probe. Very large values of e drastically change 
the Hamiltonian and the energy landscape. It is interesting to note that the scaling seems 
to work well up to (i ~ 0.4, that is the radius of the ball of degenerate ground states in the 
original D model. 

The physical interpretation of this result is the following. For any fixed e the scaling 
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FIG. 6: Average distance d{e,L) as a function of the rescaled variable eL ^ for different values of 
L in the ND model. 



variable e L^^ vanishes in the thermodynamical limit, which implies that the unperturbed 
ground state is stable against this kind of perturbation. In order to have a different and more 
complex behavior one should perturb the original Hamiltonian by a term whose amplitude 
increases with the size at least as fast as . The energy scale can thus be interpreted as 
the energy cost for reaching the first excited state. 

A still clearer picture of this phenomenon is given in figure where we plot the average 
energy difference AE{e, L) as a function of the average distance d{e, L) between the un- 
perturbed and the perturbed ground states. It is evident that, for any fixed distance, the 
energy difference is growing with the system size, according to the argument given above. 
We have rescaled the data following equation and the results are shown in figure ^. Again 
the best collapse is achieved for 9 ~ 0.33, in perfect agreement with the previous analysis. 
Also here the scaling region extends over distances up to ^ 0.5. For small distances the 
average energy increases as /\E oc (P (see the dotted line in figure H). The same behavior 
has been observed also in the QD model and we will present a simple explanation in the 
next subsection, dedicated to the GD model. 

Since the value of the 6 exponent is small we also tried to fit the data under the assumption 
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FIG. 7: The average energy difference AE between the unperturbed and the perturbed ground 
state structures as a function of their average distance d for different L values in the ND model. 
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FIG. 8: Data of Fig. |^ rescaled according to Eq. (^). 
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AE'typ oc (logI/)°, that is 6^ = 0. This behavior is suggested by mean-field solutions of 
disordered models ||20[ and more particularly from previous findings on similar models ^ |10 



The conclusion is that a logarithmic fit with a = 1.85 ± 0.15 still works rather well, but 
definitely the power law fit with 6 = 0.33 ± 0.01 is more accurate and always has a smaller 
value. 



D. The GD Model 



Under the application of the e-coupling perturbation, the GD model behaves very simi- 
larly to the ND model in that: 

• All the data perfectly collapse with ~ 1/3; 

• For small e, d oc e; 

• For small d, AE oc d"^. 

Moreover the results listed here do not depend on the presence of the constraint iij = for 
|z — j| < 4 (see section II). These findings imply that we have at hand a very simplified 
model which share the same phenomenology with more realistic models and which is more 
amenable to an analytical treatment. For sake of clarity we recall its definition: We have 
an Hamiltonian of the form given in equation (|l|), where the pairing energies Cij = eji are 
L{L — l)/2 independent Gaussian variables with zero mean and unitary variance, and the 
iij satisfy the planarity condition. 

Within this model it is easier, for example, to understand the behavior AE oc d"^ for small 
d. First of all we observe from numerical simulations that in a typical GSS £o the fraction 
of paired bases is / < 1 and the distribution of the pairing energies Cij of the active links, 
the ones with if^^ = 1, can be very well approximated by a Gaussian of negative mean and 
finite width (the distribution is truncated since positive pairing energies are forbidden in the 
GSS). Let us call the distribution of the pairing energies absolute values Pe{e). The only 
property we need for the proof is a finite weight in zero, -Pe(O) > 0, and this is the case for 
the GD model (and also for the ND model). 

Now we construct a sequence of structures £k such that AE oc rf^ for small d. £k is 
obtained from £q removing the k weakest links, i.e. those with the smallest (in absolute 
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value) pairing energies. So the distance between £o and £k is d = k/L and the energy 
difference AE is the sum of the smallest (in absolute value) k pairing energies. For large L 
we can write 

AE= r P^{e') e' de' = PM% , (6) 
Jo 2, 

where the last equality only holds for small u. The upper integration limit u is chosen such 
that k pairing energies, or equivalently a fraction 2fc/(/L) of pairing energies, are summed, 
that is 

|| = y = />e(e')de' = Pe(0)« , (7) 

where the last equation is valid for small u. Combining the above equations we obtain 
AE oc (f. 

Since we have chosen a sequence of structures which are not guaranteed to have the lowest 
possible energies, we can only argue that AE oc (P with a > 2. Nevertheless from numerical 
simulations the exponent turns out to be exactly 2 (see figure P). 

In this very simplified GD model we can make one more analytical prediction, regarding 
the fraction of paired bases in the GSS. The number of planar structures with a fraction / 
of paired bases can be easily calculated with the help of generating functions and turns out 
to be given by exp[Ls(/)], with an intensive entropy 

s(/) = -/log/-(l-/)log(l-/) + /log2 . (8) 

s{f) has a maximum for / = |, with s(|) = log 3. 

Let us now fix / and see how the energies of the exp[Ls(/)] structures are distributed. 
They look random, but actually, since the independent Gaussian random variables are only 
L{L — l)/2, there must be many correlations among them. Since any of these energies is 
the sum of fL/2 random Gaussian pairing energies, we make the approximation that the 
distribution of structure energies is also Gaussian with a variance proportional to /L, i.e. 
V{E) oc exp[— i?^/(6 /L)]. The evaluation of the coefficient h is out of our present scopes. 
Given, for any fixed /, the number of structures and the distribution of the energies, we can 
estimate the most probable lowest energy -Emin(/) through 

e-^<f) = V{E') dE' ^ e-^n.n/(^/^) , (9) 

J —oo 

where the last equality holds because -Emin is negative and large. The above equation implies 
Erainif) = " hf s{f) . In order to find the fraction of paired bases corresponding to 
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FIG. 9: As a function of the system size L we compare the ground state energy of the GD model 
(below) with the one reached by a simple greedy algorithm (above). The horizontal lines correspond 
to the infinite size extrapolations. 

the GSS one has to minimize i?min(/), or equivalently maximize fs{f), over /. Such an 
extremum is achieved for / = 0.86 to be compared with the fraction of paired bases found 
numerically / = 0.856. The rather small discrepancy tells us that the approximation made 
on the form of the structure energies distribution V{E) is not so bad. 

The apparent correctness of such a simple approximation could suggest that the GD 
model has a trivial energy landscape. We have checked for this possibility with the following 
method: In a trivial energy landscape any reasonably smart greedy algorithm should be able 
to reach, or at least to closely approach, the ground state energy. We have used a greedy 
algorithm which builds up the structure in the following way: It starts with a structure with 
no links, at each step it chooses the lowest negative pairing energy (largest in absolute value) 
among the set of those allowed by the planarity condition, and adds the corresponding link 
to the growing structure. Using this greedy algorithm we can reach the energies shown in 
figure 1^ which are more than 10% higher than the corresponding ground state energies. So, 
it seems that finding a structure with low energy in linear time is not an easy task. This 
suggests a complex energy landscape. A deeper analysis is obviously needed in order to say 
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how much complex the energy landscape of the GD model is. 



V. CONCLUSIONS 

Our results allow to describe a clear and simple physical picture for the RNA-inspired 
models studied here. All of them possess a glassy phase at low enough temperatures (since 
we have analyzed very low energy density of states we cannot make predictions on the 
location of the critical temperature). We can claim that our study clearly selects a positive 
9 exponent value for all cases but for the quasi degenerate QD model, where our numerical 
results are not precise enough to allow us quantitative statements. 

At variance with the results of |jlO|, |ll|] we find that in the non-degenerate ND model the 



broken phase does not look marginal, but a standard droplet glassy phase with 1/3 > 
0. Our way of analyzing the data allows us to exclude (with good confidence) a simple 
logarithmic divergence of the energy difference between ground state and excited states. On 
this issue we agree with the results of [jl2[: The difference in the estimate we give for 6', as 
compared to the 6 ~ 0.23 of [0, is probably due to the different fitting procedure, and to 
the number of free parameters used in the fit. 

The 9 exponent we find is perfectly compatible with that for directed polymers in random 
media in 1+1 dimensions [0, 9£)prm = |- Since the two models have some similarities this 
relation could indeed hide a deep connection. 

The degenerate D model and the quasi degenerate QD model we have defined above are 
maybe the less trivial and the most intriguing from the theoretical point of view. Unfor- 
tunately we were not able to determine accurately the asymptotic scaling behavior in the 
latter. 

It is probable, on the contrary, that the most part of the analytic developments will be 
obtained for the Gaussian disorder GD model, that is by far the simplest among all the 
models with a non-trivial behavior. 
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